-
Notifications
You must be signed in to change notification settings - Fork 304
[Feat]: Add Dynamo E2E test profile with GPU support #789
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
✅ Deploy Preview for vllm-semantic-router ready!
To edit notification comments on pull requests, go to your Netlify project configuration. |
e4d5f12 to
56977b0
Compare
👥 vLLM Semantic Team NotificationThe following members have been identified for the changed files in this PR and have been automatically assigned: 📁
|
95b5e13 to
a225953
Compare
|
@abdallahsamabd this is cool! Do you happen to have a demo too? We can add the demo to the YouTube channel. |
|
I will create a demo video soon, that will show running the full Dynamo/profile flow which will run all written tests as well, and will show the Dynamo components that being deployed, also will show some manual requests. |
5edfcb0 to
04675ff
Compare
- Add Dynamo profile for GPU-enabled disaggregated vLLM deployment - Add GPU setup integration in Kind cluster (nvidia runtime, library copy, device plugin) - Add DynamoGraphDeployment with Frontend + Prefill + Decode workers - Add Dynamo test cases (health check, GPU utilization, performance) - Fix relative paths in runner.go and profile.go - Re-enable teardown after tests - Remove Dynamo from CI matrix (requires GPU, run manually) - Update README with GPU requirements (3 GPUs minimum) - Remove unused files (namespace.yaml, kustomization.yaml, nvkind-gpu-setup-rhel.md) Requires: VM with 3+ NVIDIA GPUs Run: make e2e-test-dynamo Signed-off-by: abdallahsamabd <[email protected]>
04675ff to
be02009
Compare

Requires: VM with 3+ NVIDIA GPUs
Run: make e2e-test-dynamo
FILL IN THE PR DESCRIPTION HERE
BEFORE SUBMITTING, PLEASE READ THE CHECKLIST BELOW AND FILL IN THE DESCRIPTION ABOVE
-swhen doinggit commit[Bugfix],[Feat], and[CI].Detailed Checklist (Click to Expand)
Thank you for your contribution to semantic-router! Before submitting the pull request, please ensure the PR meets the following criteria. This helps us maintain the code quality and improve the efficiency of the review process.
PR Title and Classification
Please try to classify PRs for easy understanding of the type of changes. The PR title is prefixed appropriately to indicate the type of change. Please use one of the following:
[Bugfix]for bug fixes.[CI/Build]for build or continuous integration improvements.[Doc]for documentation fixes and improvements.[Feat]for new features in the cluster (e.g., autoscaling, disaggregated prefill, etc.).[Router]for changes to thevllm_router(e.g., routing algorithm, router observability, etc.).[Misc]for PRs that do not fit the above categories. Please use this sparingly.Note: If the PR spans more than one category, please include all relevant prefixes.
Code Quality
The PR need to meet the following code quality standards:
pre-committo format your code. SeeREADME.mdfor installation.DCO and Signed-off-by
When contributing changes to this project, you must agree to the DCO. Commits must include a
Signed-off-by:header which certifies agreement with the terms of the DCO.Using
-swithgit commitwill automatically add this header.What to Expect for the Reviews